Where Does the Alignment Score Distribution Shape Come from?
نویسندگان
چکیده
Alignment algorithms are powerful tools for searching for homologous proteins in databases, providing a score for each sequence present in the database. It has been well known for 20 years that the shape of the score distribution looks like an extreme value distribution. The extremely large number of times biologists face this class of distributions raises the question of the evolutionary origin of this probability law.WE INVESTIGATED THE POSSIBILITY OF DERIVING THE MAIN PROPERTIES OF SEQUENCE ALIGNMENT SCORE DISTRIBUTIONS FROM A BASIC EVOLUTIONARY PROCESS: a duplication-divergence protein evolution process in a sequence space. Firstly, the distribution of sequences in this space was defined with respect to the genetic distance between sequences. Secondly, we derived a basic relation between the genetic distance and the alignment score. We obtained a novel score probability distribution which is qualitatively very similar to that of Karlin-Altschul but performing better than all other previous model.
منابع مشابه
Whither Mental Health Policy-Where Does It Come from and Does It Go Anywhere Useful?; Comment on “Cross-National Diffusion of Mental Health Policy”
Factors influencing cross-national diffusion of mental health policy are important to understand but complex to research. This commentary discusses Shen’s research study on cross-national diffusion of mental health policy; examines the extent to which the three questions researched by Shen (whether countries are more likely to have a mental health policy (a) the earlier a country becomes a memb...
متن کاملImage Classification via Sparse Representation and Subspace Alignment
Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...
متن کاملgpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences
Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...
متن کاملFACE ALIGNMENT USING BOOSTED APPEARANCE MODEL (Discriminative Appearance Model)
This thesis explores method of face alignment using Boosted Appearance Model (BAM). Like Active Appearance Model (AAM) we call our method as Boosted Appearance Model (BAM) since our appearnce model is trained by boosting. In this method, face alignment is done by maximizing the score of a trained two-classifer which is able to distinguish correct alignment and incorrect alignment, so that the c...
متن کاملAn Application of Non-response Bias Reduction Using Propensity Score Methods
Normal distribution is widely used in many applications. The problem of testing whether observations come from a normal distribution has been studied extensively by many researchers. Our main goal in this article is to present a simple test procedure for testing multivariate normality.
متن کامل